Rank in Wordlist | Frequency | Word |
---|---|---|
8949 | 42 | 1,5 |
9410 | 39 | 1,000 |
9412 | 39 | 10,000 |
10997 | 31 | 100,000 |
11478 | 29 | 30,000 |
12047 | 27 | 2,5 |
13014 | 24 | 5,000 |
14203 | 21 | 20,000 |
14672 | 20 | 50,000 |
15141 | 19 | 2,000 |
Rank in Wordlist | Frequency | Word |
---|---|---|
4693 | 101 | beoordeling(en |
16572 | 17 | compilation(s |
24245 | 10 | time(s |
24426 | 9 | 747minuter(för |
31201 | 7 | user(s |
36824 | 5 | PVG(B |
44176 | 4 | bed(s |
45643 | 4 | kommentarer(0 |
52265 | 3 | Oscar(R |
57844 | 3 | person(s |
Rank in Wordlist | Frequency | Word |
---|---|---|
1714 | 337 | 0)My |
4577 | 104 | 0)Mijn |
22820 | 10 | %) |
29594 | 7 | Tafsir)Tafsir |
31324 | 6 | %). |
44469 | 4 | cm). |
47699 | 3 | %), |
52349 | 3 | Paris)… |
54795 | 3 | brave). |
60174 | 2 | $) |
Rank in Wordlist | Frequency | Word |
---|---|---|
2723 | 198 | 100% |
5297 | 86 | 50% |
6611 | 64 | 20% |
6617 | 64 | 90% |
8140 | 48 | 10% |
8395 | 46 | 80% |
9415 | 39 | 25% |
10574 | 33 | 30% |
10577 | 33 | 5% |
11239 | 30 | 75% |
Rank in Wordlist | Frequency | Word |
---|---|---|
5832 | 76 | H&M |
9440 | 39 | R&B |
11794 | 28 | ID&T |
14208 | 21 | A&R |
16495 | 17 | W&W |
17062 | 16 | Lova&jag |
17853 | 15 | R&D |
18449 | 14 | A&B |
18711 | 14 | Q&A |
20411 | 12 | AT&T |
Rank in Wordlist | Frequency | Word |
---|---|---|
11740 | 28 | $100 |
13772 | 22 | $50 |
14197 | 21 | $1 |
14198 | 21 | $10 |
14199 | 21 | $20 |
15686 | 18 | $200 |
15687 | 18 | $500 |
16260 | 17 | $300 |
16261 | 17 | $5 |
18414 | 14 | $25 |
Rank in Wordlist | Frequency | Word |
---|---|---|
210 | 2622 | I'm |
223 | 2392 | it's |
269 | 1996 | don't |
371 | 1461 | It's |
538 | 1020 | I've |
691 | 809 | didn't |
723 | 775 | you're |
737 | 761 | can't |
754 | 748 | that's |
803 | 706 | doesn't |
Rank in Wordlist | Frequency | Word |
---|---|---|
87137 | 1 | %&* |
Rank in Wordlist | Frequency | Word |
---|---|---|
2885 | 186 | S/Y |
3517 | 146 | and/or |
8335 | 47 | he/she |
9298 | 40 | OEPP/EPPO |
9411 | 39 | 1/2 |
10066 | 36 | his/her |
11001 | 31 | A:0/A:0 |
11390 | 30 | km/h |
12355 | 26 | 24/7 |
14217 | 21 | Apache-Coyote/1.1 |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots